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^ I Abstract 

In this two-part paper, we address the problem of finding the optimal precoding/multiplexing 
scheme for a set of non-cooperative links sharing the same physical resources, e.g., time and band- 
width. We consider two alternative optimization problems: P.l) the maximization of mutual in- 
formation on each link, given constraints on the transmit power and spectral mask; and P.2) the 
O . maximization of the transmission rate on each link, using finite order constellations, under the same 

constraints as in P.l, plus a constraint on the maximum average error probability on each link. 
Aiming at finding decentralized strategies, we adopted as optimality criterion the achievement of a 
Nash equilibrium and thus we formulated both problems P.l and P.2 as strategic noncooperative 
, (matrix-valued) games. In Part I of this two-part paper, after deriving the optimal structure of the 

, linear transceivers for both games, we provided a unified set of sufficient conditions that guarantee 

I the uniqueness of the Nash equilibrium. In this Part II, we focus on the achievement of the equilib- 

, rium and propose alternative distributed iterative algorithms that solve both games. Specifically, the 

new proposed algorithms are the following: 1) the sequential and simultaneous iterative waterfill- 
ing based algorithms, incorporating spectral mask constraints; 2) the sequential and simultaneous 
gradient projection based algorithms, establishing an interesting link with variational inequality 
problems. Our main contribution is to provide sufficient conditions for the global convergence of 
all the proposed algorithms which, although derived under stronger constraints, incorporating for 
example spectral mask constraints, have a broader validity than the convergence conditions known 
in the current literature for the sequential iterative waterfilling algorithm. 



1 Introduction and Motivation 

The goal of this two-part paper is to find the optimal precoding/multiplexing schemes for a set of 
non-cooperative links sharing the same physical resources, e.g. time and bandwidth. Two alterna- 
tive optimization problems are considered [T]: P.l) the maximization of mutual information on each 
link, given constraints on the transmit power and spectral emission mask, imposed by radio spectrum 
regulatory bodies; and P.2) the maximization of the transmission rate on each link, using finite or- 
der constellations, under the same constraints as in P.l, plus a constraint on the maximum average 

*This work was supported by the SURFACE project funded by the European Community under Contract IST-4- 
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error probability on each link. We focus on decentralized strategies to avoid coordination among the 
separated links and the heavy signaling required by a global controller that would need to collect all 
relevant information from all the users. The search for decentralized solutions motivated our formu- 
lation within the convenient framework of game theory. We thus adopt as optimality criterion the 
achievement of a Nash Equilibrium (NE) [2] and we cast both optimization problems P.l and P.2 as 
strategic non-cooperative (matrix- valued) games [Ij, where the goal of each user is to optimize its own 
precoding/multiplexing matrix. In Part I of this two-part paper [U Theorem 1], we proved that there is 
no performance loss in reducing both original matrix-valued games into a unified vector-valued power 
control game, where the optimal strategy of each user corresponds to finding the power allocation 
that maximizes its own (information) rate, treating the multiuser interference due to the other users 
as additive colored noise. We will refer to this power control game as rate-maximization game. In 
Part I, we proved that the solution set of the rate- maximization game is always nonempty and derived 
(sufficient) conditions that guarantee the uniqueness of the NE [1] Theorem 2]. In this Part II, we 
propose alternative algorithms that reach the Nash equilibria of the unified vector game, in a totally 
distributed manner. 

All the distributed algorithms used to compute the Nash equilibria of a (rational [2j) strategic 
non-cooperative game are based on a simple idea: Each player optimizes iteratively its own payoff 
function following a prescribed updating schedule, for example, simultaneously with the other users 
(i.e., according to a Jacobi scheme [3]), or sequentially (i.e., according to a Gauss-Seidel scheme [3]). 
Differently from the optimization of a single-user system, where the optimal transceiver structure can 
be obtained in a single shot (depending on the interference scenario [3]-[U]), in a competitive multiuser 
context like a game, it is necessary to adopt an iterative algorithm, as each user's choice affects the 
interference perceived by the other users. However, the competitive nature of the multiuser system 
does not guarantee in general the convergence of such an iterative scheme, even if the payoff function 
of each player is strictly concave (or strictly convex) in its own strategies and the NE is unique. This 
issue motivated several works in the literature [7] -[11], |21j-|29j. where alternative approaches have been 
proposed to study the convergence of iterative algorithms in strategic non-cooperative games. 

A traditional approach comes from classical scalar power control problems in flat-fading CDMA 
(or TDMA/FDMA) wireless networks (either cellular or ad-hoc) [7]-|16)P] where each user has only 
one variable to optimize: its transmit power. This kind of problems can be elegantly recast as convex 
optimization problems (see, e.g., [IS]- [2^) or as the so called "standard" problems (in the sense of 
[5]-|llj). for which distributed (either synchronous or asynchronous) algorithms along with their con- 
vergence properties are available [7]-|16j. |2Uj . The rate-maximization game proposed in this paper is 
more involved, as it falls in the class of vector power control problems, where each player has a vector 
to optimize (i.e., its power allocation across frequency bins) and the best-response function of each 
user (the waterfilling mapping) is not a standard function (in the sense of [9]-|llj). Hence, the classical 
framework of [7j-[Il] cannot be successfully applied to our game theoretical formulation. 

^Note that, even though some of these papers do not contain any expUcit reference to game theory, the problems 
therein can be naturally reformulated as a strategic non-cooperative game, where the Nash equilibria are the fixed-points 
of proper best response correspondences. 



A special case of the rate-maximization game proposed in this paper was studied in [2T] in the 
absence of spectral mask constraints, where the authors formulated the vector power control problem 
for a Digital Subscriber Line (DSL) system, modelled as a Gaussian frequency-selective interference 
channel, as a two-person strategic non-cooperative game. To reach the Nash equilibria of the game, 
the authors proposed the sequential Iterative Waterfilling Algorithm (IWFA), which is an instance 
of the Gauss-Seidel scheme p]: The users maximize their own information rates sequentially (one 
after the other), according to a fixed updating order. Each user performs the single-user waterfilling 
solution given the interference generated by the others as additive (colored) noise. The most appealing 
features of the sequential IWFA are its low-complexity and its distributed nature. In fact, to compute 
the waterfilling solution, each user only needs to measure the noise-plus-interference Power Spectral 
Density (PSD), without requiring specific knowledge of the power allocations and the channel transfer 
functions of the other users. The convergence of the sequential IWFA has been studied in a number 
of works |22j-[28j. each time obtaining milder conditions that guarantee convergence. However, despite 
its appealing properties, the sequential IWFA suffers from slow convergence if the number of users in 
the network is large, because of the sequential updating strategy. In addition, the algorithm requires 
some form of central scheduling to determine the order in which users update their strategy. 

The original contributions of this paper with respect to the current literature on vector games |21j - 
[26j are listed next. First, to compute the Nash equilibria of both games P.l and P.2 (introduced in 
Part I IT]), we generalize the sequential IWFA of [21], including the spectral mask constraint and a 
possible memory in the updating process. Then, to overcome the potential slow convergence rate of the 
sequential IWFA, we propose a new iterative algorithm, called simultaneous IWFA. The simultaneous 
IWFA is an instance of the Jacobi scheme f3]: At each iteration, all users update their own strategies 
simultaneously, still according to the single-user waterfilling solution, but using the interference gen- 
erated by the others in the previous iteration. We provide results on the convergence speed of both 
algorithms, showing that the simultaneous IWFA is faster than the sequential IWFA, still keeping the 
desired properties of the sequential IWFA, i.e., its distributed nature and low complexity. The second 
important contribution of the paper is to provide a unified set of sufficient conditions ensuring the global 
convergence of both algorithms. Our conditions are proved to have broader validity than those given 
in |21j-|25j. |26j (obtained without mask constraints) and, more recently, in [27] (obtained including 
mask constraints) for the sequential IWFA. Moreover, they show that the range of applicability with 
guaranteed convergence of both sequential and simultaneous IWFAs includes scenarios where the inter- 
fering users may be rather close to each other. Finally, exploring the link between the Nash equilibria 
of our game theoretical formulation and the solutions to the so-called variational inequality problems 
[30]- [32], we propose, as alternative to the IWFAs, two novel gradient projection based iterative al- 
gorithms, namely the sequential and simultaneous Iterative Gradient Projection Algorithms (IGPAs) 
and provide conditions for their global convergence. 

Throughout the paper, there is a common thread relating the algorithms and the derivation of their 
convergence conditions: The interpretation of the waterfilling operator as the Euclidean projector of a 
vector onto a convex set. In the single-user case, this provides an alternative perspective of the well- 
known waterfilling solution, that dates back to Shannon in 1949 [33]. Interestingly, in the multiuser 



case, this interpretation plays a key role in proving the convergence of the proposed algorithms. 

The paper is organized as follows. After briefly reviewing, in Section [21 the game theoretic formu- 
lation addressed in Part I of the paper [H Theorem 2] , Section [3] provides the interpretation of the 
waterfilling operator as a projector. Section H] contains the main contribution of the paper: A variety 
of distributed algorithms for the computation of the Nash equilibria of the game, along with their 
convergence properties. Finally, in Section [5l some conclusions are drawn. Preliminary versions of this 
paper appeared in [251 [28l [29]. 

2 System Model and Problem Formulation 

We consider a Gaussian vector interference channel [35], composed by Q non-cooperative links. Aiming 
at finding distributed algorithms, we focus on transmission techniques where no interference cancellation 
is performed and multiuser interference is treated as additive colored noise. Moreover, we consider a 
block transmission without loss of generality (w.l.o.g.), as it is a capacity-lossless strategy for sufficiently 
large block length [36]-[38]- Then, under assumptions detailed in Part I [IJ, the system design consists 
in finding the optimal transmit/receive matrix pair for each link independently of the others, according 
to some performance metrics. In Part I of this paper [Tj, we assumed as optimality criterion the 
achievement of the NE and considered the two following strategic non-cooperative games: 

P.l The maximization of mutual information on each link, given constraints on the transmit power 
and on the spectral radiation mask; 

P.2 The maximization of the transmission rate on each link, using finite order constellations, under 
the same constraints as in P.l plus a constraint on the average (uncoded) error probability. 

After showing that the solution set of both games is always nonempty, in [H Theorem 1] we proved 
that the optimal transmission strategy for each link leads to Gaussian signaling plus the diagonal 
transmission through the channel eigenmodes (i.e., the frequency subchannels), irrespective of the 
channel state, power budget, spectral mask constraints and interference levels. Thanks to this result, 
both matrix-valued games P.l and P.2 can be recast, with no performance loss, as the following simpler 
vector power control game [H Theorem 1]: 

maximize i?„(p„,p_„) 
(^) : P« , G (1) 

subject to pg G 

where = {1, 2, . . . , Q} is the set of players (i.e., active links), is the set of admissible strategies of 
player q j^l 

^,= |p,G K^:^ E?^'?(^) = l' ^<P,{k)<p^^^{k\ VA;G{l,...,iV}|, (2) 



^In order to avoid the trivial solution p*(fc) = p^^'^'ik) for all fc G {1, . . . , A^}, (1/iV) J2k=i Pq"^"" i^) > 1 is assumed for 
all g £ SI. Furthermore, in the feasible strategy set of each player, we can replace, w.l.o.g., the original inequality power 
constraint (1/A'') "^k^iPiW ^ li with equality, since, at the optimum, this constraint must be satisfied with equality 
from all users. 



where p™^^{k) = p^^^{k) / Pq, with p^^^{k) denoting the maximum power that is allowed to be allocated 
on the fc-th frequency bin from the g-th user, and -Rq(pg,p_g) is the payoff function of player q : 



1^/1 \ 



(3) 
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where Hrq{k) = Hrq{k)^Pr/ (c^ Hrq{k) denotes the frequency-response on the subcarrier k of 
the channel between source r and destination q, d^q is the distance between source r and destination q, 
and 7 is the path loss. The SNR gap Tq in ([3]) is set equal to 1 if game in P.l is considered, whereas 
Tq = (Q~n-Pe*g/4))V3 139J, if we consider P.2, where Q(-) denotes the Q-function [34] and P^^ is the 
maximum tolerable (uncoded) average symbol error probability on link q. 

In [H Theorem 2], we showed that the solution set of ^ is always nonempty and coincides with the 
solution set of the following nonlinear fixed-point equation: 

p^ = WF,(pt,...,p^_„p^+i,...,p^) =WF,(p%) , V(?eO, (5) 

with the waterfilling operator WF^ (•) defined as 

^ + Y.r^q\Hrq{k)\'pr{k) 



[WF,(p_,)],^ 



1 2 _ P«*"''('=) 



\Hqq{k)\' 



{!,..., iV}, (6) 



where denotes the Euclidean projection of x onto the interval [a, 6] If] The water-level is chosen 
to satisfy the power constraint (1/A^) SfcLiPg(^) = 1- 

Observe that system ([5]) contains, as special cases, the solutions to power control games already 
studied in the literature |21j-|26j. when all the players are assumed to transmit with the same power 
and no spectral mask constraints are imposed (i.e., when p^^^{k) = +oo, \/q, VA;). In this case, the 
Nash equilibria of game ^ are given by the classical simultaneous waterfilling solutions |21j-|26j. where 
WFg (•) in (0) is still obtained from ^ simply setting p^^^[k) = +oo, Vg, Mk. However, in the presence 
of spectral mask constraints, the results of [2T]-[26] cannot be applied to system In Part I of this 
paper [H Theorem 2] , we studied system ([5]) and provided sufficient conditions for the uniqueness of the 
solution. The problem we address here is how to reach solutions to ([5]) (leading to the Nash equilibria 
of ^) by means of totally distributed algorithms. 

3 Waterfilling Operator as a Projector 

In this section we provide an interpretation of the waterfilling operator as a proper Euclidean projector. 
This interpretation will be instrumental to prove the convergence properties of some of the algorithms 
proposed in the subsequent sections. 



^The Euclidean projection [x]^ is defined as follows: [x\^ = a, ii x < a, [x]^ = x, li a < x < b, and [x\^ = b, ii x > b. 



3.1 A new look at the single-user waterfilling solution 

Consider a parallel additive colored Gaussian noise channel composed of subchannels with coefficients 
{H{k)}, subject to some spectral mask constraints {p^^^{k)}^^i and to a global average transmit power 
constraint across the subchannels. It is well-known that the capacity-achieving solution for this channel 
is obtained using independent Gaussian signaling across the subchannels with the following waterfilling 
power allocation [JO] 



P- 



-I 



\H{k)\ 







A; G {!,..., AT}, (7) 



where cr^ denotes the noise variance on the k-ih subchannel, p*{k) is the optimal power allocation 
over the k-th subchannel. The water-level /x in ([7]) is chosen in order to satisfy the power constraint 
(l/A^)Ef=iP^(^) = l- 

We show now that, interestingly, the solution in d?]) can be interpreted as the Euclidean projection 
of the vector — insr, defined as 



\nsr^[al/\H{l)\\...,a%/\H{N)\'f (8) 



onto the simplex 



S 



A 



1 ^ 1 



k=l 



Lemma 1 The Euclidean projection of the N -dimensional real nonpositive vector — xq = — [xo,i, • • • , a^o,Af] 
onto the simplex S defined in denoted by [—xq]^, is by definition the solution to the following convex 
optimization problem: 



minimize llx — (— xo)||2 

X 

subject to < xk<p^''''{k), VA; e {1, . . . , A^} 

N 



(10) 



1 ^ 



N 

k=l 



and assumes the following form: 



4=[/^-^o,fc]^''^'^ A.G{i,...,A^}, (11) 

where ii > is chosen in order to satisfy the constraint (l/N) J2k=i ^k ~ 

Proof. See Appendix El ■ 
Lemma[T]is an extension of [46^ Lemma 1] to the case where interval bounds [0, p"^^^{k)] are included 
in the optimization. But what is important to remark about Lemma [U (and this is a contribution of 
this paper) is that it allows us to interpret the waterfilling operator as a projector, according to the 
following corollary. 

Corollary 1 The waterfilling solution p* = [p*(l), . . . ,p*{N)]^ in ^ can be expressed as the projec- 
tion of —insr given in (0j onto the simplex S in (^: 

p'^ = [-insr]5. (12) 



Corollary 2 The waterfiUing solution in the form 



(7t 



\H{kW 



kG{l,...,N}, 



(13) 



where w = [wi, . . . jWjyj] is any positive vector, can be expressed as the projection with respect to the 



weighted Euclidean nor with weights wi, . . . ,wn, of — insr given in (0j onto the simplex S in 



-insr 



(14) 



The graphical interpretation of the waterfilling solution as a Eudidean projector, for the single- 
carrier two-user case, is given in Figure [T) For any insr = (insri, insr2) corresponding to a point in 
the interior of the gray region (e.g., point A), the waterfilling solution allocates power over both the 
channels. If, instead, the vector insr is outside the gray region (e.g., point B), all the power is allocated 
only over one channel, the one with the highest normalized gain. 




Figure 1: Graphical interpretation of waterfilling solution ((7| as a projection onto the two-dimensional simplex. 



3.2 Simultaneous multiuser waterfilling 

In the multiuser scenario described in game 5^, the optimal power allocation of each user depends on 
the power allocation of the other users through the received interference, according to the simultaneous 
multiuser waterfilling solution in ([5]). 

As in the single-user case, introducing the vector insrg(p_g), defined as 

[insr,(p_,)], 4 r,^l^^^i-^^i/^^, G {1, . . . , AT}, (15) 



and invoking Lemma [TJ we obtain the following corollary. 

"^The weighted Euclidean norm HxHj ^ is defined as ||x||2 ^ — (X^i 



|2xl/2 



Corollary 3 The waterfiUing operator\Nfq (p-<j) in can be expressed as the projection o/— insrg(p_g) 
defined in U5\) onto the simplex given in 



Comparing (0) with (|16|) . it is straightforward to see that all the Nash equilibria of game ^ can be 
alternatively obtained as the fixed-points of the mapping defined in (jl6p . whose existence is guaranteed 
by [H Theorem 2]: 



In Appendix IHl we provide the key properties of the mapping in (|16p . that will be instrumental to 
obtain sufficient conditions for the convergence of the distributed iterative algorithms based on the 
waterfilling solution and described in Section 14. 1[ 

4 Distributed Algorithms 

In [U Theorem 2], we proved that, under some (sufficient) conditions on transmit powers, channels and 
network topology, the NE for game 5^ is unique. Since there is no reason to expect a system to be 
initially at the equilibrium, the concept of equilibrium has a useful meaning in practice only if one is 
able to find a procedure that reaches such an equilibrium from non-equilibrium states. In this section, 
we focus on algorithms that converge to these equilibria. 

Since we are interested in a decentralized implementation, where no signaling among different users 
is allowed, we consider only totally distributed iterative algorithms, where each user acts independently 
of the others to optimize its own power allocation while perceiving the other users as interference. 
The main issue of this approach is to guarantee the convergence of such an iterative scheme. In the 
following, we propose two alternative classes of totally distributed iterative algorithms along with their 
convergence properties, namely: iterative algorithms based on the waterfilling solution ([6]), and iterative 
algorithms based on the gradient projection mapping. 

4.1 Distributed algorithms based on waterfilling 

So far, we have shown that the Nash equilibria of game are fixed-points (see ([5])) of the waterfilling 
mapping defined in ([6]). Hence, to achieve these solutions by a distributed scheme, it is natural to employ 
an iterative algorithm based on the best response Based on this idea, we consider two classes 
of iterative algorithms: sequential algorithms, where the users update their strategies sequentially 
according to a given schedule; and simultaneous algorithms, where all the users update their strategies 
at the same time. In the following sections, we provide a formal description of both algorithms and 
derive the conditions guaranteeing their convergence to the unique NE of the game. 

Before describing the proposed algorithms, we introduce the following intermediate definitions. 
Given game 5^, let C {1, • • • , N} denote the set {!,..., N} (possibly) deprived of the carrier indices 
that user q would never use as the best response set to any strategies adopted by the other users, for 
the given set of transmit power and propagation channels [1]: 




(16) 



Pg = [-'nsrg(p%)]^ , yqen. 



(17) 



Vq^{k£{l,...,N}:3p_q£ such that [WFg (p_g)]^ / 0} 



(18) 



with WFg (•) defined in ([6]) and 4 x • • • x x ^<^+i x • • • x ^q. In Part I of tlie paper [T], 

we provided an iterative procedure to estimate sucli a set. We also introduce tlie matrix H™**^ € M*^^'^, 
defined as 



-n iHrqik)]"^ dqq Pj. 

Tq rnax - ,,,,^ -^7^, if r ^ q, 



kdV^nVr \Hqq{k)\'^ (Trq Pq ' (19) 

0, otlierwise, 



witli tlie convention that the maximum in (|19p is zero if Vq fl 2?^. is empty. 
4.1.1 Sequential iterative waterfilling algorithm revisited 

The sequential IWFA is an instance of the Gauss-Seidel scheme [3]: All users update their own strategies 
sequentially, performing the waterfilling solution ([5]). The algorithm is described in Algorithm 1. 

Algorithm 1: Sequential Iterative Waterfilling Algorithm 

Set \)q^ = any feasible power allocation, Mq G 



for n = : Nit 



! I WF„fp^"jV if (n+ l)modg = g, 

' Pq , otherwise. 



end 



The convergence of the algorithm is guaranteed under the following sufficient conditions. 

Theorem 1 Assume that the following condition is satisfied: 

p(H^^^)<l, (CI) 

where H™'^^ is defined in il9\) and p(H™^^) denotes the spectral radiu^ of the matrix H™'^^. Then, as 
Nit — > CX3, the sequential IWFA described in Algorithm^ converges linearly to the unique NE of game 
, for any set of initial conditions belonging to =^ and for any updating schedule. 

Proof. See Appendix O ■ 

Remark 1 — Global convergence and uniqueness of the NE. Even though the optimization 
problem ([1]) is nonlinear, condition (|Cip guarantees the global convergence of the sequential IWFA, 
irrespective of the specific users' updating order. Moreover, the global asymptotic stability of the NE 
implies also the uniqueness of the equilibrium. Condition (jCip indeed coincides with the uniqueness 
condition given in [H Corollary 1]. 

To give additional insight into the physical interpretation of sufficient conditions for the convergence 
of the sequential IWFA, we provide the following corollaries of Theorem [TJ 



^The spectral radius p (H) of the matrix H, is defined as p(H) = max{|A| : A G eig(H)}, with eig (H) denoting the 
set of eigenvalues of H [44, . 



Corollary 4 A sufficient conditions for iCl\) is given by one of the two following set of conditions: 
i E me.^_ ji^l < 1, W e (C3) 



where w = [wi, . . . , wq]^ is any positive vector. 



Corollary 5 The best vector w in liC^} - [C3\} is given by the solution to the following geometric pro- 
gramming problem 



where Grq is defined as 



minimize t 

subject to Grqt^^W^^Wr < 1, V(/, (^21) 

r=l,r^q 

w > 0, t > 0, 

G.,*r,,_a» (22) 



keVrCTDq \ \Hqq{k)\'^ J <£}.q 



qWq 



if / fC^p is used, or as 



if < fC3|) is used. 



dPrrPqWq 



^^'^ ^ kf^vfnVg \ \Hrrik)\'^ J dqr Pr Wr' ^ "^"^ 



Note that, as by direct product of the proof of Theorem [H one can always choose the full set 
Dq = {1, . . . , N} in (|Cip and ()C2p - (|C3p . However, less stringent conditions are obtained by removing 
the unnecessary carriers, i.e., the carriers that, for the given power budget and interference levels, are 
never going to be used. 

Remark 2 — Physical interpretation of convergence conditions. As already shown in Part 
I of the paper ^ for the uniqueness conditions of the NE, the convergence of sequential IWFA is 
guaranteed if the interferers are sufficiently far apart from the destinations. In fact, from (jCip or ()C2p - 
(|C3p . one infers that, for any given set of channel realizations and power constraints, there exists a 
distance beyond which the sequential IWFA is guaranteed to converge, corresponding to the maximum 
level of interference that may be tolerated by each receiver (as quantified, e.g., in ()C2p ) or that may 
be generated by each transmitter (as quantified, e.g., in (|C3P ). Interestingly, the presence of spectral 
mask constraints does not affect the convergence capability of the algorithm. Moreover, convergence 
condition (jCip (or ()C2p - ()C3p i has the same desired properties as the uniqueness conditions obtained 
in Part I of the paper: It is robust against the worst normalized channels \Hrq{k)\'^ /\Hqq{k)\'^ , since the 
subchannels corresponding to the highest ratios \Hrq{k)\'^ /\Hqq{k)\'^ (and, in particular, the subchannels 
where \Hqq{k)\'^ is vanishing) do not necessarily affect (jCip (or (jC2p - (jC3p ). as their subcarrier indices 
may not belong to the set Dq. This strongly relaxes the convergence conditions. 



We can generalize the sequential IWFA given in Algorithm [T] by introducing a memory in the 
updating process, as given in Algorithm [2l We call this new algorithm smoothed sequential IWFA. 



Algorithm 2: Smoothed Sequential Iterative Waterfilling Algorithm 



Set pq = any feasible power allocation and Ug E [0, l),\/q £ Q; 
for n = : Nit 




a,pS") + (1 - ag)WF, (pL';^) , if (n + 1) mod Q = q, 



(24) 



Pq , otherwise, 



end 



Each factor aq € [0, 1) in Algorithm [2] can be interpreted as a forgetting factor: The larger is aq, 
the longer is the memory of the algorithm. The choice of each aq depends on the channel stationarity 
and on possible channel fluctuations or estimation errors. If the channel is fixed or highly stationary 
and there are channel estimation errors that induce zero mean fluctuations on the allocated power, it 
is convenient to take each Oq close to 1, to smooth out the undesired fluctuations. Conversely, when 
the channel is rapidly varying, it is better to take a small aq. Interestingly, the choice of {aq}q£^ does 
not affect the convergence property of the algorithm, as proved in the following. 

Theorem 2 Assume that conditions of Theorem [7] are satisfied. Then, as Nit — > oo, the smoothed 
sequential IWFA described in Algorithmic converges linearly to the unique NE of game ^ , for any set 
of initial conditions in ^ , updating schedule, and {aq}qi^n, with Oq € [0, 1), V(; € ^l. 

Proof. See Appendix O ■ 

Remark 3 — Comparison with previous results. The sequential IWFA described in Algorithm 
[T] generalizes the well-known sequential iterative waterfilling algorithm originally proposed by Yu et 
al. in [21] and then studied in [22j-[26j. to the case in which the users have (possibly) different power 
budgets and there are spectral mask constraints. In fact, the algorithm in [21j can be obtained as 
a special case of Algorithm [U by removing the spectral mask constraints in each set ,'^q in (i.e., 
setting p^^^{k) = +oo, VA;, q) and replacing the waterfilling operator in ^ with the classical waterfilling 



where {x) = max(0, x) and insr^ is defined in (jlSp . 

The convergence of the sequential IWFA based on the mapping (j25p has been studied in a number 
of works, each time obtaining milder convergence conditions. Specifically, in [21], the authors provided 
sufficient conditions for the existence of a NE and the convergence of the sequential IWFA, for a game 
composed by two players. This was later generalized to an arbitrary number of players in |22j-|25j. 
In [26], the case of flat-fading channels was considered. Interestingly, although derived under stronger 
constraints, incorporating for example spectral mask constraints, our convergence conditions have a 
broader validity than those obtained in |21|-[26j. as shown in the followingl^ 

®We summarize the main results of [21]-[25) using our notation. 



solution 




(25) 



Corollary 6 Sufficient conditions for are f21[ dH [2. 



or H 



|^nj(^)Pl Pj. ^ 1 



''..|r«,iwKlt<2Q33. V.<,^,..fi. (C5) 

In the case of fiat-fading channels (i.e., Hrq{k) = Hj-q, yr,q), condition W^) becomes f26\j 

Q I fV |2 p 

Recently, alternative sufficient conditions for the convergence of sequential IWFA as given in Algo- 
rithm [T] were independently given in Specifically, the sequential IWFA was proved to converge to 
the unique NE of the game if the following condition is satisfiedjfl 

p(T)<l, (C6) 

where /?(T) denotes the spectral radius of the matrix T = (I — H™^'^)""^ HJJ^^p , with H™^^ and HJJjfp 
denoting the strictly lower and strictly upper triangular part of the matrix H™'^^, respectively, with 
jjmax (igfingfi^ in our notation, as in (|19p. where each Vg is replaced by the full set {!,..., N}. 

As an example, in Figure [21 we compare the range of validity of our convergence condition (jCip 
with that of ()C4p and (|C6p . over a set of channel impulse responses generated as vectors composed of 
i.i.d. complex Gaussian random variables with zero mean and unit variance. In the figure, we plot the 
probability that conditions (jCip . ()C4p and ()C6P are satisfied versus the ratio drq/dgq, which measures 
how far apart are the interferers from the destination, with respect to the intended source. In Figure 
[21^ a) we consider a system composed by Q = 5 users, and in Figure[2^b) a system with Q = 15 links. For 
the sake of simplicity, to limit the number of free parameters, we assumed drq = dqr, Pq = Pr Vr, g, and 
w = 1. We tested our condition considering the set Pg, obtained using the algorithm given in [1]. We 
can see, from Figure [21 that the probability of guaranteeing convergence increases as the distance of the 
interferers, normalized to the source-destination distance, increases (i.e., the ratio d^q/dqq increases). 
Interestingly, the probability that (|Cip is satisfied, differently from (|C4p and ()C6p . exhibits a neat 
threshold behavior as it transits very rapidly from the non-convergence guarantee to the almost certain 
convergence, as the ratio drq/dgq increases by a small percentage. This shows that the convergence 
conditions depend, fundamentally, on the interferers distance, rather than on the channel realizations. 
Finally, it is worthwhile noticing that our conditions have a broader validity than ()C4p and ()C6P . As an 
example, for a system with probability of guaranteeing convergence of 0.99 and Q = 15, conditions (jCip 
only require drq/dqq ~ 4.2, whereas conditions ()C4p and ()C6P require drq/dgq > 50 and drq/dgq ~ 40, 
respectively. Furthermore, comparing Figure [2^a) with Figure [2]^b) , one can see that this difference 
increases as the number Q of links increases. 

Remark 4 — Distributed nature of the algorithm. The sequential IWFA as described in Algo- 
rithms [1] and [2] can be implemented in a distributed way, since each user, to maximize its own rate. 



'^We thank Prof. Facchinei, who kindly brought to our attention reference [23, after this paper was completed. 
*We write conditions of [27] using our notation. 
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(a) (b) 
Figure 2: Probability of jCT}, ICit and llC6t versus drq/dqq; Q = 5 [subplot (a)], Q = 15 [subplot (b)], 7 = 2.5, drq = dqr, 

drr = dqq = 1, P, = P^, Fq = 1, Pq/a'^ = 7dB, Pr/(a^d'^q) = 3dB, Vr, q £ w = 1. 



needs only to measure the PSD of the thermal noise plus the overall MUI (see dH)). However, despite 
its appealing properties, the algorithm may suffer from slow convergence if the number of users in the 
network is large, as we will also show in Section [4.1 .Si This drawback is due to the sequential schedule 
in the users' updates, wherein each user, to choose its own strategy, is forced to wait for all the other 
users scheduled before. Moreover, although distributed, both algorithms require that all users share a 
prescribed updating schedule. This requires a centralized synchronization mechanism that determines 
the order and the update times of the users. We show next how to remove these limitations. 



4.1.2 Simultaneous iterative waterfilling algorithm 

To overcome the main limitation of sequential IWFAs given in Algorithms [U and [21 we consider in this 
section the simultaneous version of the IWFA, called simultaneous IWFA. The algorithm is an instance 
of the Jacobi scheme [3j: At each iteration, all users update their own power allocation simultaneously, 
performing the waterfilling solution given the interference generated by the other users in the 
previous iteration. Stated in mathematical terms, the proposed algorithm is described in Algorithm [3] 
[281 [29]. 

Algorithm 3: Simultaneous Iterative Waterfilling Algorithm 

Set pq°^ = any feasible power allocation, Vg G 17; 
for n = : Nit 

p("+^) = WF, (p("\ . . . , p(!.\, pgi, . . . , pj)) , yqen, (27) 

end 



As for the sequential IWFA, also in the simultaneous IWFA we can introduce a memory in the 
updating process and obtain the so-called smoothed simultaneous IWFA, as described in Algorithm [J] 
[281 [29]. 



Algorithm 4: Smoothed Simultaneous Iterative Waterfilling Algorithm 

Set pq^^ = any feasible power allocation and Ug E [0, l),\/q £ U; 
for n = : Nit 

p("+i) = a,pW + (1 - a,)WF, (p("\ . . . , p("_\, pSi, . . . , pj^) , VqeQ, (28) 
end 

Interestingly, both Algorithm [3] and H] are guaranteed to globally converge to the unique NE of the 
game, under the same sufficient conditions of the sequential IWFA, as proved in the following. 

Theorem 3 Assume that conditions of Theorem{l\ are satisfied. Then, as Nit oo, the simultaneous 
IWFAs described in Algorithmic <ind Algorithm^ converge linearly to the unique NE of game ^, for 
any set of initial conditions in ^ and {aq}q^n, with aq € [0, 1), V(7 € Vt. 

Proof. See Appendix [Dj ■ 

Additional (weaker) convergence conditions for Algorithm [3] and H] are given next. Introducing the 
matrix H(/c) G M'^xQ^ defined as 



|H(fc)l„ 4 



^Wlt- ^f^^^.n^.and,,. ^^^^ 
0, otherwise, 
we have the following theorem. 

Theorem 4 Assume that the following conditions are satisfied: 

(H^(A:)H(A:)) < 1, VA: G {1, . . . , A^}, (30) 

where H(A;) is defined in i29\) . Then, as Nit oo, the sequential IWFj^ described in Algorithm 
converges linearly to the unique NE of game ^ , for any set of initial conditions in 

Proof. See Appendix |El ■ 

Remark 5 — Sequential versus simultaneous IWFA. Since both simultaneous IWFAs in Algo- 
rithms [3] and H] are still based on the waterfilling solution ([6|), they keep the most appealing features 
of the sequential IWFA, namely its low-complexity and distributed nature. In fact, as in the sequen- 
tial IWFA, also in the simultaneous IWFA each user only needs to locally measure the PSD of the 
interference received from the other users and water-pour over this level. In addition, thanks to the 
Jacobi-based update, all the users are allowed to choose their optimal power allocation simultaneously. 
Hence, the simultaneous IWFA is expected to be faster than the sequential IWFA, especially if the 
number of active users in the network is large. We formalize this intuition in the next section. 



^Condition (|30|l is sufficient aiso for the convergence of tfie smootfied simuitaneous IWFA described in Aigoritlim |4l 



provided tliat tfie second fiand-side of (|30p is replaced by e = i_mhi''g" 2'' — ^- 



4.1.3 Asymptotic convergence rate 

In this section we provide an upper bound of the convergence rate of both (smoothed) sequential and 
simultaneous IWFAs. The convergence rate can be either measured on the average or for the worst 
possible initial vector p^'^). In the following we focus on the latter approach, introducing the asymptotic 
convergence exponent. 

Denoting by p* and p^"^ the NE of game ^ and the power allocation vector obtained by the proposed 
algorithm at the n-th iteration, respectively, the distance between p^") and p* can be measured by some 
vector norm Hp^"-* ~ P*|| ' which is to be compared with the initial distance Ijp^'^^ — p*|| . This leads to 
the following asymptotic convergence exponent for the worst-case convergence rate 




d=- sup lim_ 3 In ( jj^^^^ ^ ) . (31) 



.W-r.-^ -Ce"'^", (32) 



Since for large n 

P ' - P 

where C is a constant that depends on the initial conditions, d gives the (asymptotic) number of 
iterations for the error to decrease by the factor 1/e (for the worst possible initial vector). 

Since the waterfilling operator is not a monotone mapping, only (upper) bounds for the asymptotic 
convergence exponent can be obtained |41l I42j . as given in the following. 



Proposition 1 Lei dg"^ and dl°^ be lower bound of d in \31\) obtained using (smoothed) sequential 
IWFA in Algorithmic and (smoothed) simultaneous IWFA in Algorithm\^ respectively. Under condition 
liCS^) of Corollary \^ we have 



dll"^ = - log I max { 



a +(l_a)Iiy max /[S^^l 



dsim ~ Q dseq ■ (34) 

Proof. The proof follows directly from Proposition 2 in Appendix ■ 



Remark 6 — Convergence speed. Expression (j33p shows that the convergence speed of the algo- 
rithms depends, as expected, on the memory factors {a^lggf^ and on the level of interference. Given 
{o.q}qen, the convergence speed increases as the interference level decreases. Since and dl°^ are 
only bounds of the asymptotic convergence exponent, a comparison between the sequential IWFA and 
the simultaneous IWFA by d^?^ and dl°^ might not be fair. These bound becomes meaningful if dl°^ 
and dl°^ approximate with equality dsim and dseq, respectively, for some initial conditions (cf. [IT]). 

In Figure [3] we compare the performance of the sequential and simultaneous IWFA, in terms of 
convergence speed. We consider a network composed of 50 links and we show the rate evolution of 
three of the links corresponding to the sequential IWFA and simultaneous IWFA as a function of the 
iteration index n as defined in Algorithms [1] and [3l To make the figure not excessively overcrowded, 
we report only the curves of 3 out of 50 links. As expected, the sequential IWFA is slower than the 
simultaneous IWFA, especially if the number of active links Q is large, since each user is forced to wait 
for all the users scheduled in advance, before updating its own power allocation. 
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Figure 3: Rates of the users versus iterations: sequential IWFA (solid line curves), simultaneous IWFA (dashed line curves), 



Q = 50, 7 = 2.5, drq = dqr, drr 



1, = P^, r, = 1, Pq/al = 7dB, Pr/{<TldT.q) = 3dB, Vr,g G n 



4.2 Distributed algorithms based on gradient projection 

In this section we propose two alternative distributed algorithms based on the Gradient Projection 
mapping. The first algorithm is an instance of the Jacobi scheme, whereas the second one is based 
on the Gauss-Seidel procedure. Both algorithms come out from an interesting interpretation of the 
Nash equilibria in ([5]) as solutions to a proper Nonlinear Variational Inequality (NVI) problem [30^ 
Sec. 1.4.2], as we show next. The NVI problem is defined as follows. Given a subset X C M" of the 
Euclidean n-th dimensional space M" and a mapping f : M" M", the (nonlinear) variational inequality 
is to find a vector x* € Af such that [30, Def. 1.1.1] 

(NVI) (x - x*)^ f (x*) > 0, Vx G X. (35) 

All the Nash equilibria in ([5]) of game can be written as solutions to a NVI problem. In fact, 
a feasible strategy profile p* = [p*^, . . . , p^ satisfies ([5]) if and only if the following necessary and 
sufficient optimality conditions hold true [45] |^ 



{p,-Plf{-VgRM, p%)) > 0, Vp, € ^g, yq G n, 



(36) 



where Vgi?q(p*, p* g) denotes the gradient vector of Rg with respect to Pg, evaluated in (p*,p*g), and 
^g is defined in Comparing ([35|) with ([36]) . it is straightforward to see that a strategy profile p* 
is a NE of ^ if and only if it is a solution to the NVI problem (j35p . with the following identifications: 



X = XiX...xXq 

T 



X 



Xi 



1 1 • • • 



^ = X . . . X . 

P= [pJ,---,PQf 



^qRq{Pq,P- 



qh 



(37) 
(38) 
(39) 



^"observe that, given the strategy profiles of the other players, the optimal strategy of each player is a solution to the 
convex optimization problem defined in ((T}, whose optimality conditions, for any given q and p* g, can be written as in 



Ml [451 Sec. 4.2.3] 



where /g(x) denotes the g-th component of f(x) = [/i(x), . . . , /q(x)]^. In fact, if ([36|) is satisfied 
for each g, then summing over 5, (j35p follows. Conversely, assume that (j35p holds true for some 
p*. Then, for any given q, choosing p_g = p*^ and p^ G J^g, we have p = (Pg,p*q) G =^ and 
(P-P*f [-Vf/?i(p*),...,-V^iiQ(p^)]^ = (p,-p*)'^(-V,i?,(p^,p*_,)) > 0, Vp, G 

Building on the equivalence between psp and (j36p [301 [32], we can obtain distributed algorithms 
that reach the Nash equilibria of game ^ by looking for algorithms that solve the NVI problem in ()35p . 
A similar approach was already followed in |23J, where the equivalence between the Nash equilibria of 
a DSL game that is a special case of ^ and the solutions to a proper nonlinear complementary problem 
[3Uj was shown. However, the algorithms proposed in [23J to compute the NE solutions, in general, 
cannot be implemented in a distributed way, since they need a centralized control having access to all 
channel state information and to the PSD of all users. Differently from [23], we exploit the equivalence 
between ()35p and (|36p and propose two alternative totally distributed algorithms that do not require 
any centralized control to be implemented and have the same computational complexity as the IWFAs. 
To this end, we need the following intermediate result that comes directly from the NVI formulation 
in (IMI) i2 Prop. 5.1]. 

Lemma 2 Let (3 he a positive scalar and {Gglggfj he a set of symmetric positive definite matrices. 

A vector p* = [p^"^, . . . jPg^]"^ is a NE of game ^ if and only if it is a fixed point of the following 

mapping)^ p, 

p^ = TGg(p*)= [p^ + /3G-iV,i2,(p*,p%)]^^^, VgGO, (40) 

where Vy^^ is the Euclidean projection on with respect to the vector norm ||x||q_^ = (x^Ggx)^/^. 

Given Lemma [21 to reach the Nash equilibria of it is natural to employ an iterative algorithm, 
based either on Jacobi or Gauss-Seidel schemes, using as best response for each user the mapping in 
(|40p . Specifically, if the mapping in (|40p is used in the Jacobi scheme, we obtain the simultaneous 
Iterative Gradient Projection Algorithm (IGPA), as described in Algorithm [5] 

Algorithm 5: Simultaneous Iterative Gradient Projection Algorithm 

Set pf^ = any feasible power allocation, G O, and /? > 0; 



for n = : Nit 

p(") + /3G-iV,i?,(p("))l Vg G n, (41) 



p(n+l) 



end 



The sequential update of the strategies from the players can be easily obtained from ([40]l by using the 
Gauss-Seidel scheme, and provides the sequential IGPA, as given in Algorithm [6] 

^^The mapping in (140 |l always admits at least one fixed point, since it satisfies Brouwer's fixed point theorem |48l 
Theorem 4.2.5]. In fact, each set is compact and convex, and the mapping Tg(p) = (Tgj {p))q^n in (|40[) is continuous 
on 0^ = X ■ ■ ■ X ^Q, since each -Rq(p) is a continuously diflerentiable function of p and the projector operator is 
continuous as well [31 Prop. 3.2c]. 



Algorithm 6: Sequential Iterative Gradient Projection Algorithm 

Set pq^^ = any feasible power allocation, Vg G il, and /? > 0; 
for n = : Nit 



p(n+l) 



pj") + /3G-iV,i?, (p(")) if (n + l)modQ = 9, 



otherwise, 



yq e ft; 



(42) 



end 



The positive constant P and the set of (positive definite) matrices {GgjggQ are free parameters that 
affect the convergence property of the algorithms [3] • Sufficient conditions for the convergence of both 
sequential and simultaneous IGPAs are given in Appendix iFl 

Remark 7 — Computation of the projection. According to the best response mapping defined 
in ()4ip and ()42p . both Algorithms [5] and [6] require, at each iteration, the computation of the Euclidean 



projection 



on the feasible strategy set given in ([2]), with respect to the norm 



given P, {Gqjq^n, and p 



Pq] € the projections in (jlTj) and written as 



Tg,(p) = [p, - /3G-V,(P)] J , with /,(p) = -VqRqip 



can be computed solving the following convex quadratic programming: 



For any 



(43) 



mimmize 



-p'^MqPq + p^qq 



subject to Pq G 



(44) 



(45) 



where 

Mq^{l/P)Gq, and q, = q,(p) = /,(p) - (l//3)G,p„ 

with /g(p) given in (03]). Observe that in the special case of Gq = I, the mapping TGg(p) in (P3|) be- 
comes the classical Euclidean projection on the set ^q, that can be efficiently computed as a waterfilling 
solution, as shown in Section [3.11 (cf. Lemma [1]). 

Interestingly, to compute the projection in (jl3|) . a variety of alternative algorithms can be obtained, 
interpreting TGg(p) in (j43p as the unique solution to a proper Linear Variational Inequality (LVI) 
problem [3], as we show next. Using the scaled projection Theorem [3, Prop. 3.7(b)] 1^ one can find 
that the projection TGq{p) in can be equivalently defined as the unique vector p* G ^q such that 

(y, - pp^G, (p, - /3G-V,(P) - P^) < 0, Vy, G ^„ 



which, since /3 > 0, can be rewritten as 

(LVI) (y. 



Mgp* + qq) > 0, Vyg G 



9' 



(46) 



^■^The scaled projection theorem says that, given some x £ R" and a convex set X C R", a vector z £ X is equal to 
[x]^ if and only if (y — z)'^G(y — z) < 0, Vy G X, where [x]^ denotes the Euclidean projection of x on A" with respect 
to the norm |M|„ . 



1.1 




Simultaneous IGPA 
Simultaneous IWFA 



0.5' ' ' ' ' ' 

1 5 10 15 20 25 

iterations 

Figure 4: Users' rates versus iterations; simultaneous IGPA (solid line curves) and simultaneous IWFA (dashed-line curves), 
Q = 35, 7 = 2.5, drq = V. <^rr = dgg = 1, Pg = P^,rg = 1, P,/(t2 = 7dB, Pr/{aldfq) = 3dB, Vr, q £ Q. 

where Mg and are defined in (j45p . Inequality in (j46p still defines a variational inequality problem (see 
(|35|) ). but computationally simpler than the original one given in (j36p . as the function Mgp* + qg(p) 
in (j46p . for any given p, is linear in p*. Observe that, since Gq (and thus M^) is positive definite, the 
(unique) solution p* in ([l6j) is well-defined, as LVI in (jl6]) admits a unique solution [3l Prop. 5.5]. A 
variety of algorithms, known in the literature as linearized algorithms can be used to efficiently solve 
the LVI in The interested reader may refer to [301 ESI EI] ^oi a broad overview of these algorithms. 

Remark 8 — Distributed nature of the algorithms. Interestingly, both IGPAs keep the most 
appealing features of IWFAs, namely its low-complexity distributed nature. In fact, as in IWFAs, also 
in IGPAs each user needs only to locally measure the PSD of the overall interference received from the 
other users and project a vector that depends on this interference (i.e., Vg-Rq(p)) onto its own feasible 
set. 

Numerical Example. As an example, in Figured we compare the performance of the simultaneous 
IGPA with the simultaneous IWFA, in terms of convergence speed. We consider a network composed of 
(5 = 35 active users and compare the rate evolution of 3 out of 35 links as a function of the iteration index 
n, as defined in Algorithms [3] and [5j Interestingly, the simultaneous IGPA shows similar convergence 
speed than simultaneous IWFA. Thus, it can be used as a valid alternative to the simultaneous IWFA. 

5 Conclusions 

In this two-part paper, we have formulated the problem of finding the optimal linear transceivers in a 
multipoint-to-multipoint wideband network, as a strategic non-cooperative game. We first considered 
the theoretical problem of maximizing mutual information on each link, given constraints on the spectral 
mask and transmit power. Then, to accommodate for practical implementation aspects, we focused on 
the competitive maximization of the transmission rate on each link, using finite order constellations, 
under the same constraints as above plus a constraint on the average error probability. In Part I of the 
paper we fully characterized both games by providing a unified expression for the optimal structure of 



the linear transceivers and deriving conditions for the uniqueness of the NE. In this Part II, we have 
focused on how to reach these equihbria using totahy decentrahzed algorithms. We have proposed and 
analyzed alternative distributed iterative algorithms along with their convergence conditions, namely: 
1) the sequential IWFA, which is a generalization of the well-known (sequential) iterative waterfilling 
algorithm proposed by Yu et. al. to the case where spectral mask constraints are incorporated in 
the optimization; and 2) the simultaneous IWFA, which has been shown to converge faster than the 
sequential IWFA; 3) The sequential and simultaneous IGPAs, which are based on the gradient projection 
best response, and establish an interesting link between the Nash equilibria of the game and the 
solutions to the corresponding variational inequality problem. Interestingly, the simultaneous IGPA 
has been shown to have approximately the same convergence speed and computational complexity 
of the simultaneous IWFA, and thus it can be a valid alternative to the algorithms based on the 
waterfilling solutions. We have derived the sufficient conditions for the global convergence of all the 
proposed algorithms that, although proved under stronger constraints (e.g., the additional spectral mask 
constraint), have broader validity than the convergence conditions known in the current literature for 
the sequential IWFA proposed by Yu et al. 

We are currently investigating the extension of the proposed algorithms to the case in which the 
updating strategies are performed in a totally asynchronous way [19]. The other major extension that 
needs to be addressed is the situation where the channels and interference covariance matrices are 
known only within an inevitable estimation error. 

6 Appendix 

A Proof of Lemma [1] 

First of all, observe that the objective function of the convex problem (jlOp is coercive on the feasible set 
[17] . Hence, a solution for the problem pUj) exists [17|. Since problem (flUj) satisfies Slater's condition 
[45tl47j. the Karush-Kuhn- Tucker (KKT) conditions are both necessary and sufficient for the optimality. 
The Lagrangian corresponding to the constrained convex problem (jlOp is 



k=l k=l \ k=l ) k=l 



kixk-p^^'^ik)), (47) 



and the KKT conditions are: 



Xk+xo,k - i^fc + Afe - /u = 0, 

T^k > 0, Xk> 0, VkXk = 0, 

Afc>0, Xfc <p'^^^(A:), Afc(xfc-p"^^^(A:)) = 0, ke{l,...,N}, (48) 
1 ^ 

-T^Y.Xk = l. 

k=i 



with xo,fc > 0, VA; G {1, . . . , A^} and ^ = ^/N. 



Observe that, if (1/iV) Efc=i^'"'^''(^) < 1 

or p™^^(A;) < for some k, then the problem is infeasible; if 



(1/iV) Y.k=iP^'^'^{k) = 1' t'^sii tlie problem admits the trivial solution Xk = p^'^'^ik), V/c; if p™''''(/c) = 



for some k, then Xk = 0. Here after, we thus assume that all the subcarrier indices corresponding to the 

{k) > 1 (to avoid the trivial solution). 



zero- valued p'^''^^[k) s have been removed and {l/N) ^^LiP™'^^ 



First of all, observe that ^ > (in fact, /i < is not admissible, since the constraint (1/A^) X^^Li = 
1 necessarily implies Xk= —xq^^ — + > 0, for at least one A;). If x^ = 0, since z^fc > and (by 
the complementary slackness condition) = 0, then /i — xq ^ = —V}^ < 0. If < Xj^ < p^^^{k)^ then 
z^fc = and = 0; which provides x^ = ^ — a^o,^, (observe that < ;U — XQ j. < p^^^{k)). Finally, if 
Xk = p^^^{k), then 1/^ = and A^ > 0, which implies /i — xo,fc ^ p™^^(fc). Since, for each k, the values 
of the admissible solution induce a partition on the set of the /i values, the solution can be written as 



Xk = [/i 



0, if /i - xo,fc < 0, 

p"^^^(A:), if /X - xo^k > p^'^'^ik), 



where /i is chosen so that (1/A^) X^^Li [a* 



^(fc) 



A;G{l,...,iV}, 



B Properties of Waterfilling Projection 



First of all, it is convenient to rewrite the waterfilling operator in ()16p as 



WF,(p_,) 



(49) 



where 



H 



r„ diag 



Hrq(l) 



\Hqq{l] 



|2 ' 



\Hrq{N)\ 
\Hqq{N)\ 



\/r^q,qen, 



(50) 



l/\H,,{l)\\...,l/\H,,{N)f 



and is defined in ([2]). 

Building on (|49p . we derive now a key property of the waterfilling operator that will be fundamental 
in proving Theorems [1] and [3l To this end, we introduce the following mapping. For technical reasons, 
we first define 

(51) 

[ 0, otherwise, 

where Vq is given in (jlSp . and introduce the admissible set ^'^^ = 1^^^ x • • • x C where 
is the subset of containing all the feasible power allocations of user g, with zero power over the 
carriers that user q would never use, for the given power budget and interference level, in any of its 
waterfilling solutions ([U]), against any admissible strategy of the others: 



o^f 4 {pg G with pq(k) = Q\/k(f: Vg} 

( I ^ 

= lpq€ : - J2pq{k) = 1, < Pg{k) < 

[ k=l 



p^-(A:), yke{i,...,N}\, 



(52) 



where the second equality in (j52p follows from the properties of the waterfilling solution ([6]) (cf. Ap- 
pendix!^. Observe that, because of (j52p . the game does not change if we use J^'^^ instead of the 



original . For any given {oglggf^ with € [0,1), let T(p) = (Tg(p))ggf7 : 3^""^^ be the 
mapping defined, for each q, as 



Tg(p) = Pg + (1 - Og) 



Og Pg + (1 - Og) 



-<Tg — ^ HrgPr 

-<Tg — HrgPr 



, p e 



«efT 



Og G [0,1), 



(53) 



where the second equality follows from (j52p . Observe that the operator in (j53p is indeed a mapping from 
^eflf ^eff^ i^^g convexity of Moreover, it follows from (j49p that all the Nash equilibria 

p* = (Pg)ggf^ of game in ([1]) (see (fT7|) ) satisfy the following set of equations 



(54) 



^cff 



which correspond to the fixed points in .f^^^^ of the mapping T defined in (j53p . Hence, the existence of 
at least one fixed point for T is guaranteed by the existence of a NE for game [U Theorem 2]. 

Before proving the main property of the mapping T, we need the following intermediated definitions. 
Given T in ([53l) and some w = [wq, . . . ,wq]'^ > 0, let |Hl2'biock denote the (vector) block-maximum 
norm, defined as [3] 

'|T,(P)L 



I T(p) Unblock = max 



qen Wq 

where H-L is the Euclidean norm. Let IHI^^pf, be the vector weighted maximum norm, defined as 



(55) 



|x|l^,vcc - max w > 0, X G 

' q&il Wq 



(56) 



and let |H|^ j^^t denote the matrix norm induced by 

Q 



I w 

loo,vec ' 



I A IIW A 

|A|L,^,t = max^^ 



, defined as [441 

— y[A]qrWr, AGR^^xQ, 



(57) 



r=l 



Finally, define |Hl2Dg ^ 



1/2 



X G 



(58) 



,kev„ 



with Dq defined in psp . Observe that |Hl2X)^ is not a vector norm (as does not satisfy the positivity 
property), but it is a vector seminorm 

The mapping T in (j53p is said to be a block- contraction with modulus /?, with respect to the norm 
IHITbiock m ([55]) . if there exists /3 G [0, 1) such that [H Sec. 3.1.2] 



T(p«)-T(p(2); 



2,block 



</5 



p«-p(2) 



w 

2,block 



Vp«,p(2)G 



q ■ 



(59) 



We provide now some interesting properties for the mapping T in ()53p . that will be instrumental to 
study the convergence of both sequential and simultaneous IWFAs. 



Lemma 3 (Nonexpansive property of the waterfilling mapping) Given Vg, , and || ||2X'q 
defined in I118\) . L5^) and ^5^). respectively, let [•]" '[^ denote the projector operator onto the convex set 

S^q^ with respect to the vector norm \\-\\. Then, [•]" '[.^ satisfies the following nonexpansive property: 



X 



< !|x-y| 



Vx,y e 



(60) 



Proof. For any given e > 0, let ||-||2 denote the weighted vector norm (derived from an inner product 
defined as 



1/2 



X G 



e > 0. 



(61) 



Then the projector [ • ] ' satisfies the following inequality: 



X 



1 



I rOlcS 
1 



e 



X 



1 



I ^acff 
<3 



<||x-y|r2^ Vx,y GE^, Ve>0, (62) 



where the equality in ([62]) follows from [xo]^j,g = [xol^^ff ' > due to the equivalence between the 

^ q q 

optimization problem (jlOp and the same problem where the original objective function is replaced by 

(||x - (-xo)||2_x>,)^ = EfceD, (a^fc + XQ^kf + ^Efc^D, {-x^^kf (since any x ^&>f is such that = 

IMI21? 

0, Vfc ^ Pq); and the inequality in (l62l) represents the nonexpansion property of the projector [ • ] ' 
in the norm ||-||2x>g |2l Prop. 3.2(c)] 

Since ||-||2 -p is a continuous function of e > 0, taking in ()62p the limit as e — 0, and using 



lim IMIo Ti 



l2,X>„ ' 



we obtain the desired inequality, as stated in ([60]). ■ 

Proposition 2 (Contraction property of mapping T) Given w = [tui, . . . ^wq^> 0, assume that 
the following condition is satisfied: 

1 1 jjmax 1 1 w 



where H" 



oo.mat < 1' (63) 

anc? II ■ 11^ mat '^'"6 defined in [W\) and (5l\ ), respectively. Then, the mapping T defined in 
is a block- contraction with modulus (3 = ||H™'^^||^ j^g^^, with respect to the block-maximum norm 
•|l2[biock defined in (55^. 



Proof. The proof consists in showing that, under ()63p . the mapping T satisfies (I59p . with /3 
HH'^'^'^II^. Given p^^) = (pj^\ . . ■ ,P^q^) G and p(2) = (pf\ . . . ,p[j^^) G define 



T.(P«: 



T,(p(')) 



and e„ 



p(i) 



g G 0. 



(64) 



^■^Observe that the nonexpansive property of the projector, usually given in the Euclidean norm, is preserved in any 
vector norm (derived from an inner product) used to define the projection. 



Then, we have: 



< On 



p(2) 



+ (1 



an 



^cff 



(2) 
r 



'^9 



aqCq + (1 - 



< agCg + (1 - Oq 



agCq + (1 - aq) 



r^q 



cacff 



2) 



2,X>o 



2,Oo 



< UqCq + (1 - Oq 



Eh.4p«-pF 



p1^) - p(^) 



a. 



+ (1 - Q,) E r, ™p [Hrglfcfc) e., Vp(i), p(2)G^-fr, V(? G fi, 



(65) 
(66) 

(67) 

(68) 

(69) 
(70) 
(71) 



where (j66p follows from (j53p and the triangle inequality fH]; in (j67p |Hl2X'q defined in (j58p and the 
equality follows from the fact that sets to zero the elements not in T)q; (j68p follows from Lemma 



[3] (see ()60p ): in ()69p H,.q is a diagonal matrix defined as 



^ J [H^g]^^ , if G n Pg, 



(72) 



0, otherwise. 
Introducing the vectors 

ex = [cti,--- jEtq]^, and e = [ei, . . . , cq]"^, (73) 

with and Cg defined in (j64p . and the matrix 

^ D„ + (I - D„)H--, with D„^diag(aq...aQ), (74) 

where H™'^^ is defined in (|19p. Then, the set of inequalities in (|7ip for all g, can be rewritten in vectorial 
form as 

0<eT<HS^"e, Vp(i), p(2)Gi^<=ff (75) 
Using the weighted maximum norm |H|^vec defined in ()56|) in combination with ()75p . we have, Vp^"*^-*, 

lleT-ir < llH™''''eir < llH™''''ir Heir f76) 

II J- lloOjVCC — ll-^-^Q! Iloo,vcc — ll-^-^Q: lloo,mat II lloOjVec ' V'"/ 

where ||'||^ j^a^^ is the matrix norm induced by the vector norm ||'||^ in (fSep and defined in ([STD [44J. 
Finally, using ([76]) and ([551), we obtain, Vp(^), p(2)G^efTj^j^j > 

Tq(p«)-Tq(p(^); 



T(p«)-T(p 



(2)- 



= max ■ 

2,block q 



ex 



g 

w 



< "trinax w p Trmax w 

— Il-'^ct I loo, mat II lloo,vec II -"^O! I loo, mat 



p«-p(2) 



2,block 



(77) 



which leads to a block-contraction for the mapping T, if ||H™^^||^ j^^^^^. < 1, implying condition ([63 
(since each Uq G [0, 1), G fi). I 



C Proof of Theorem [T] and Theorem [2] 

Since the sequential IWFA described in Algorithm [1] is an instance of the smoothed sequential IWFA 
given in Algorithm [2] when = for all g G fi, to prove convergence of both algorithms, it is sufficient 
to show that Algorithm [21 under condition ([Cip . globally converges to the NE of game ^ , for any given 
set {aglggQ, provided that each aq G [0, 1). We thus focus in the following only on Algorithm[2l w.l.o.g.. 

It follows from Corollary [3] and (|53p that Algorithm [2] is just an instance of the Gauss-Seidel scheme 
based on the mapping T, defined in ([53p . Observe that, to study the convergence of Algorithm [21 
there is no loss of generality in considering the mapping T defined in i3^^^ C instead of , since 
all the points produced by the algorithm (except possibly the initial point, which does not affect the 
convergence of the algorithm in the subsequent iterations) as well as the Nash equilibria of the game 
are confined, by definition, in ^^^^ (see (jl7p in Appendix [Bj. Convergence of the Gauss-Seidel scheme 
based on the mapping T is given by the following result that comes from [3, Prop. 1.4]^ and [3l Prop. 
1.1a)]. 

Proposition 3 // the mapping T : ^'^B^g^f^ff defined in / f53|) is a block- contraction with respect to 
some vector norm, then: 1) The mapping T has a unique fixed point in ^'^S ■ 2) The sequence of 
vectors starting from any arbitrary point in 3^^^ ^fid generated by the Gauss-Seidel algorithm based on 
the mapping T, converges linearly to the fixed point ofT. 

It follows from Proposition [21 and Proposition [3l that the global convergence of Algorithm [2] is 
guaranteed under the sufficient condition given in ([63]) . Moreover, since ([63]) does not depend on 
{a}q(=n, the convergence of the algorithm is not affected by the particular choice of a^'s as well [provided 
that each Oq G [0, 1)]. 

To complete the proof, we just need to show that (|63p is equivalent to ([Cip . Since H™^'^ is a 
nonnegative matrix, there exists a positive vector w such that [H Corollary 6.1] 

l|H"^'1l^,^at < 1 ^ p(H--)<l. (78) 

Since the convergence of Algorithm [21 is guaranteed under ([63p . for any given > 0, we can choose 
w =w and use ([78p : which proves the desired result. 

Conditions ([C2p - (|C3P in Corollary [H can be obtained as follows. Using [3l Proposition 6.2e] 

p(H--) < IIH--II- Vw>0, (79) 
a sufficient condition for the =^ direction in ([78p is 

^''observe that the set defined in (|52p is closed, as required in [S] Prop. 1.4]. 



for some given w > 0; which provides ()C2p . The optimal vector w is given by the following Geometric 
Programming [45j 

minimize max > [H.^'^^] w~^Wr 

w q ^ — ' ^ ^ 

r=l,ry^q 

subject to w > 0, 

which provides (j2ip . 

Condition ^2i) is obtained similarly, still using ([78]) and p(H™^^) = p (h™^^^) . ■ 

D Proof of Theorem [3] 

The proof is based on the same steps as in Appendix [O It follows from [3l Prop. 1.1] that both 
Algorithm [3] and Algorithm U] linearly converge to the unique NE of game starting from any arbitrary 
point in if the mapping T defined in (j53p is a contraction (see (j59p ) in some vector norm. Using 
the block- maximum norm as defined in (|55p . invoking Proposition [51 and following the same approach 
as in Appendix O we obtain the desired sufficient condition (jCip for the global convergence of both 
Algorithm 3 and Algorithm 4, for any given set {a}q(zQ, [provided that each aq G [0, 1)]. ■ 



E Proof of Theorem [4] 

Since Algorithm [3] is an instance of Algorithm H] when each aq = 0, we focus only on the latter, w.l.o.g.. 
The proof of the theorem is based on (pseudo) contraction arguments, similarly to what we already 
shown in Appendix [Cj The main difference with respect to the approach proposed in Appendix [C] is 
due to the alternative definition of the error vector generated by Algorithm as detailed next. 

Denoting by p^"^ = {p^^)q^Q the power allocation vector generated by Algorithm [J] at iteration 
n > 1, with arbitrary starting point p^*^) G and using the mapping T defined in (j53p . we have 

p("+i) =T(p(")), Vn>l. (81) 

Let p* = (Pq)^gQ be a NE of game ^ (and thus a fixed point of the mapping T), whose existence is 
guaranteed by [H Theorem 2]. Define the vector e*^"^ = [e^"^"^, . . . ,eQ^^]"^, with 

e(n) A p(n) _ n > 1 and g G (82) 
and, given {aqjqeQ with Og G [0, 1), define the Q x Q matrix Dq, = diag(Qi . . . ag). Then, for each 



n > 1, we have 



< 



+ 



(1 - ai) (cti + Er^lHrlP 



An) 



-(1 - aq) (cTQ + Er^QH,.Qp^"^ 

(1 -Qi) (cTi +X;^_^iHrlp: 



^cff 



(83) 



< 



^AfJll2,mat 



(1 - ai) E r^l Hrl (P^^ - P 
(1 - "O) E r^gHrQ (pl''^ - Pj 

I,v ® (Iq - D«))HPe(") 



+ 



< 



|(D« ® Iiv)|l2,mat + IKlAf ® " D«)|l2, 



llTTll 

mat II 1 1 2, mat 



.(1) 



(84) 

(85) 
(86) 



where (|83p fohows from the triangle inequahty and J!^'^^ = x • • • x where "(8)" denotes 

the Kronecker product; (j84p fohows from the nonexpansive property of the Euchdean projector, the 
definition of 1!^'^^ (see (I52p ) and the definition of the diagonal matrices H^g, as given in (172^ : and 
in (j85[) we have used a permutation matrix P so that the vector e*^"^ given in (I82p . is replaced by 



Pe(") 



-1 ) • • • ) 



4")^]^, with 4 bS"^(A;),...,pS3")(A:)]^ - [pt(A:),...,p^Q(fc)]^, and the 



matrix H is defined as 



H^diag(H(l),...,H(iV)), (87) 

with H(A;) given in (j29p . The matrix norm ||H||2j^3^j in (j84p is the spectral norm (induced by the 
vector Euclidean norm [44]), defined as ||H||2jjj3^^ = 

From (j86p it follows that Algorithm 2] converges to the NE p*, from any starting point p^"^ G if 



(D. 



'I^)ll2.mat + ll(I^®(lQ-D 



"''Il2,mat 



IHI 



2, mat 



in (j86p approaches to zero as n ^ oo, which 



is guaranteed if the following conditions are satisfied 

1 - maxqgf^ aq 



P 



V2 (H^(A:)H(A:)) < ^ 



, VA; €{!,..., iV}, 



which provides the desired result. Given ([86|) . the linear convergence of the algorithm follows directly 
from m Sec. 1.3.1]. ■ 

F Proof of Convergence of Algorithm [5] and Algorithm [6] 

The global convergence of both sequential and simultaneous IGPAs, described in Algorithms [5] and 
m is guaranteed if Algorithms [5] and [6] satisfy [3l Prop. 1.1] and [3)j Prop. 1.4], respectively. To this 
end, since each HPq is compact (and thus also ^ = x • • • x S^q), it is sufficient that the mapping 
Tg(p) = (TQ^(p))ggQ : 3^ ^ IS a. block-contraction (see ([59P ) with respect to the norm ||-||q j^j^^j^ , 
defined as 



|TG(p)|lG,biock = max||TG,(p)|l 



G3,2 ' 



with IITgJp) 



(T^ (p)G,Tg,(p)) 



1/2 



where Tg„(p) is defined as 



TGg(p) = [Pq - /3Gq V<?(P<?, P-q)] , with /g(pg, p_g) = -VqRq{pq, p_g). 

Rewriting Tg, (p) in ([89]) as 

TGg(p) = [R-Gg(p)]5^ , with Rg,(p) =Pq- PG~^ fqiPq, P-q) , 

and using ([88]), we obtain: Vp(^),p(2) G 



(89) 



(90) 



Tg(p('))-Tg(p 



(2)^ 



G. block 



= max 

< max 



RG,(p(')) 



Rg,(p(^))-Rg,(p^'^) 



RG.(p(^)) 



(2)^ 



Rg(p^'))-Rg(p 



(2)^ 



(91) 

G, block 



where the inequahty follows from the non-expansive property of the projection [•] in the norm || • ||g 2- 



Prom (j91|) it follows that a sufficient condition for Tg being a contraction with respect to the norm 



lG,bIock 



m 



is that the mapping Rg(p) — (RGi(p))gen : ^ 



defined in (1901) be a 



contraction with respect to the same norm. 

We derive now sufficient conditions for Rg(p) being a contraction with respect to ||'||g block 

defined 

in (j88|) . For the sake of simplicity, we will consider only the case in which Gg = 1, Vg € 0. 

We introduce the following notation: For any /g(p) = — Vgi?g(pg, p__g), let Vr/g(p) denote the 
N X N matrix, whose j-th column is the gradient vector of the j-th component of fq{p), when viewed 
as function of Pr- Then, we have the following result that comes directly from [3l Prop. 1.10]. 

Proposition 4 As Nit — > cxd, the IGPAs described in Algorithms\^ and\^ converge to the unique NE 
of game ^ from any set of initial conditions in if there exists a scalar 6 € [0, 1) such that 



\i-(3'^qfqip)\kmat + T.mrfqip) 



1 2, mat 



< 6, Vp G ^, Vg G n, 



(92) 



where 11 A I 



2, mat 



denotes the spectral norm of the matrix A. 



We derive now a sufficient condition for ([92]) . Using /g(p) = —VqRq{pq,p-q) and we have 

V./,(p) = Bq{p)Urq, (93) 



where 



Hrq = diag ^1 



\Hrq{k)\^ 
\Hqq{kW 



Dg(p) = diag 



\Hqqik)\^ ' 



\Hrqm' 



^^^\Hqqik)\^P^^^\ 



Using ((Ml) , condition ([92]) becomes 



max|l-/3[Dg(p)];^J +/? Vmax[D,(p)H,,];^^ <6, Vp G ^, Vg G 



(94) 



(95) 



rj^q 



A sufficient condition for (1951) is 



It is straightforward to see that it is always possible to find proper (sufficiently small) /3 > and 
6 G [0, 1) (close to one) such that ([96]) is satisfied, provided that 

max [H^g]^^ = ^iY^ max ^^J''^^^ 3^ ^ < Vg G (97) 

r^q ^ r^q ^ \Hqq{k)\ ^^rq ^q 



where 



and D„(p) is defined in 



. minfc [Dg(p)]fcfe 

= mm , ,., < 1, (98) 

pe.^ maxfc [Dg(p)];,^ " ' ^ ^ 
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